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ABSTRACT 

Context. Despite its major role in the evolution of the interstellar medium, the formation of high-mass stars (M > 10 M Q ) remains 
poorly understood. Two types of massive star cluster precursors, the so-called massive dense cores (MDCs), have been observed, 
which differ in terms of their mid-infrared brightness. The origin of this difference has not yet been established and may be the result 
of evolution, density, geometry differences, or a combination of these. 

Aims. We compare several molecular tracers of physical conditions (hot cores, shocks) observed in a sample of mid-IR weakly emit- 
ting MDCs with previous results obtained in a sample of exclusively mid-IR bright MDCs. We attempt to understand the differences 
between these two types of object. 

Methods. We present single-dish observations of HDO, H' 8 0, SO2, and CH3OH lines at A = 1.3 - 3.5 mm. We study line profiles and 
estimate abundances of these molecules, and use a partial correlation method to search for trends in the results. 
Results. The detection rates of thermal emission lines are found to be very similar for both mid-IR quiet and bright objects. The 
abundances of HiO, HDO (10~ 13 to 10~ 9 in the cold outer envelopes), SO2 and CH3OH differ from source to source but independently 
of their mid-IR flux. In contrast, the methanol class I maser emission, a tracer of outflow shocks, is found to be strongly anti-correlated 
with the 12 p_m source brightnesses. 

Conclusions. The enhancement of the methanol maser emission in mid-IR quiet MDCs may be indicative of a more embedded na- 
ture. Since total masses are similar between the two samples, we suggest that the matter distribution is spherical around mid-IR quiet 
sources but flattened around mid-IR bright ones. In contrast, water emission is associated with objects containing a hot molecular core, 
irrespective of their mid-IR brightness. These results indicate that the mid-IR brightness of MDCs is an indicator of their evolutionary 
stage. 
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1. Introduction and are greater than the speed of sound (a s ^ 0.3 km-s -1 ). In 

addition, Jeans masses of massive star formation regions are 

High-mass stars play a significant role in shaping their host very low compared to their own masses (2.5 M for a mean 

galaxies, because of their high UV luminosity and mechanical temperature of 20 K, compa red to few hundre d solar masses, 

input (wind shocks and supernova?), but their formation remains e.g. the Cygnus-X sample by iMotte et~ai1l2007l) . implying that 

poorly understood. The scaling-up of existing models of low- a high level of fragmentation may play an important role. On 

• »-j , mass star formation dShu et all 1993b is unh elpful, because radi- the other hand, the magnetic field could limit thi s fragmentation 

• ation pressure would halt the accretion (e.g. lYorke & Sonnhalteil (IWard-Thompson et al]l2007b iGirart et aLll2009l) . A scenario of 

H \ 120021) leading to a final stellar mass of 10 M at most. To al- clustered massive star formation is now favoured, but its major 

. . , low the formation of more massive stars, the quasi-static see- steps have yet to be defined with new observations and studies, 
nario has been modified by increasing the mass accretion rate 

and including turbulence or energetic dynamics (Mc Kee & Tanl Searches for early stages of massive star formation have re- 

120031; iHenriksen et al.l 1 997b . Other approac hes, such as compet- vealed a class of objects, called hig h-mass protoste l lar ob jects 

itive a ccretion in massive prot ostar clusters (|Bonnell et al.ll 19971 (hereafter HMPOs) in the sample of iMolinari et al.l (1 19961) and 

120011) or protostellar mergers dBonnell et al.l 1998b may also pro- ISridharan et al.l (120021) . which are relatively extended (~lpc), 

vide an answer to this problem. The physical conditions in high- and contain one or several clumps called massive dense 

mass star formation sites indeed correspond to a dense and tur- cores (hereafter M DCs), as seen in the Cygnus-X sample by 

bulent medium where a monolithic infall onto a single massive IMotte et aD d2007l) . The main characteristics of MDCs are weak 

protostar is not expected. As an example, observed molecular radio emission from ionized gas (less than a few 10 mJy at 

emission line widths are al ways dominated by tur bulent veloc- 2.5 cm) despite a high total luminosity (L > 10 3 L ), a high 

ities (vj > 0.5 km-s -1 , e.g.. ISridharan et aT]l2002l) . and are ob- mass (> few hundreds of solar masses), and a high density 

served from large to small scales (see C 34 S lines observed by (« > 10 5 cm 4 ) in a typical size of 0.1 pc (~ 20000 AU). 

iLeurini et al.ll2007al with interferometrv in IRAS 05358+3543). They exhibit evidence of accretion activity: molecular bipo- 
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lar outflows (Beuther et al.ll200 2c). water and meth anol masers 
(IBeuther et all l2002dl). a nd large-scale i nflows dMotte et al" 
120031: iHerpin et al.ll2.Q09t) . As reviewed by IZinnecker & Yorke 
d2007l) . MDCs can be seen as proto-clusters that host the forma- 
tion of high-mass stars. 

MDCs are usually divided into mid-infrared 'quiet' 
and 'bright' sources ( hereafter mIRq and mIRb sources). 
Ivan der Tak et a define the division between them to 

be 10 Jy (12 |im flux density), whereas iMotte et al.l d2007l) as- 
sume a limit of 10 Jy for the MSX flux densities at 21.6 (J.m, 
in accordance with the distance of Cygnus-X. These values are 
equivalent to the classical limit between class and class I low- 
mass prot ostars obtained fr om the ratio of mid-IR to total lu- 
minosity (lAndre et al J 1 19931 as MDCs have similar total lumi- 
nosities. The link between mIRq and mIRb sources is not clear 
and the various mid-IR flux densities may be the consequence 
of different source orientations relative to t he line of sight, 
or differences in the gas de nsity distribution (Ivan der Tak et alj 
l2006HMarseiiie et al.l2008h . This last hypothesis is supported by 
the observation of d eveloped hot molecular cores (HMCs here- 
after) in MDCs {e.R IVan der Tak et al.ll2000l: IMotte et alJkOQBl: 
iLeurini etal ] l2007bh . On the other hand, chemical data indi- 
cate t hat mIRq sources ha ve a lower temperature than bright 
ones dMarseille et al.l 120081) . which imply they represent a less 
evolved stage. 

To more clearly define the evolutionary paths of mIRq 
and mIRb sources, this paper measures chemical, physical, 
and dynamical tracers i n a sample of te n mid-I Rq MDCs, 
extending the study by Ivan der Tak et al. | J2006h of mlRb- 
MDCs. We include water species (HDO and H^O), which 
can exhibit an abundance enhancement because of their re- 
lease in the gas phase from the dust grain surfaces, by 
mean s of a hot core, or shocks induced by bipolar outflows 
(e.g. ICernicharo et al.l Il990t Ivan Dishoeck & Helmichl Il996t 



lHarwitetal.lll998t iBoonman et al.l 120031) . Thus we probe the 
processes critically related to the formation of massive stars. To 
further investigate this approach, we focus on to hot cores and 
shock tracers using observations of a high-energy transition of 
SO2 and a methanol class I maser. We apply statistical tools to 
our sample (correlation and partial correlation factors) to con- 
strain biases and real trends in our results. 

The paper is arranged as follows. Section 2 presents the 
source sample, and Sect. 3 the new observations of HDO, H^O, 
SO2 and CH3OH molecular transitions. Section 4 checks for de- 
tections and describes the line profiles. Section 5 treats the case 
of the methanol class I maser emission, extracting the maser 
emission from the thermal emission. In Sect. 6, a modelling 
method is applied to extract molecular abundances, themselves 
analysed in Sect. 7 using statistical methods to find biases and 
real correlations between them and the physical characteristics 
of MDCs. In Sect. 8, we discuss our results and present conclu- 
sions about the class I maser emission behaviour, describe the 
differences and similarities between mIRq- and mIRb-MDCs, 
and present new ideas about their evolutionary stages. 

2. Source sample 

We observed ten MDCs with a low mid-IR brightness (see 
Table [T] fluxes have been corrected to assume a single dis- 
tance of 1 .7 kpc), taken from well-known sampl es (IBeuther et al.l 
l2002bt lMolinarietaI1ll996l: IMotte et al.l [20071) . Their physical 
properties incorporate a large range of masses (200-2000 M ) 
and luminosities (between 2xl0 3 and 3xl0 4 L Q ) within a typ- 
ical size of ~ 0.1 pc. All are dense («h, - 10 5 - 10 7 cm -3 ), 



most of them (8 in total) exhibiting water and/or methanol 
class II masers, and bipolar outflows. Chosen MDCs are weak 
at cm wavelengths (except for IRAS 05358+3543, which ex- 
hibits an UC-Hn region), making them good candidates to rep- 
resent an early stage of the formation of massive stars. Other 
studies infer a high accretion rate for these sources (~ 10~ 3 - 
10~ 4 M -yr~') derived from the power of their molecular out- 
flows, a nd a clustered formation process observed by interfer- 
omet r y dZapata et alj|2006h ILeurini et al.ll2007ah iMolinari et alj 



l2008UBontemps et al.ll2009l According to IRAS or MSX data, 
these sources emit between 0.2 and 22 Jy at 12 urn. This sam- 
ple is complementary to the one observed by Ivan der Tak et all 
(120061) . which contained mIRb-MDCs exclusively. 



3. Observations 

We performed observations at the Pico Veleta with the IRAM 
30m antenna^ in June 2006 and February 2009. Observations 
were performed while the weather was good (t£ m ~ 0.07 - 0.6 
at 225 GHz). We achieved system temperatures of 150-190 K 
for lower frequency transitions and 600-1600 K for higher ones 
(see Table[2]fbr further details). 

We used simultaneously the A100, A230, B100, and B230 
receivers to cover five molecular emission lines: HDO (1 1,0- 
loi) and (3i2-2 2 i) at 80.6 GHz and 225.9 GHz respectively, 
CH 3 OH(5i-4 ) at 84.5 GHz, H* 8 (3i,3-2 2 ,o), and SO 2 (12 ,i 2 - 
1 1 i,n) at 203.4 GHz. We tuned the VESPA correlator to obtain 
spectral resolutions of 20 and 40 kHz, hence velocity resolu- 
tions of between 50 and 80 m-s -1 (see Table[2]for details). We 
observed sources in a single point mode, pointing on the conti- 
nuum peak emission at millimetre wavelengths. We chose the 
wobbler-switching observation mode as the sources are reason- 
ably isolated, using a throw of 4' to the west. Pointing and focus 
calibrations were set on Jupiter and Neptune. The pointing accu- 
racy was of the order of 2". 

We performed the da ta reduction with the softw are CLASS 
from the GILDAS suite (iGuilloteau & Lucasll2000l) . After sub- 
tracting linear or polynomial baselines, we summed spectra ac- 
cording to the position and frequency. We converted antenna 
temperature T* into main beam temperature T m \, using the effi- 
ciency parameter 77 m b provided by IRAM (see Table|2|. We mea- 
sured the line parameters (central velocity, width, peak intensity, 
and line flux) by fitting one or multiple Gaussian profiles. 



4. Results 

4. 1 . Spectral line detections 

The methanol emission is detected towards all sources at a 
signal-to-noise ratio of 15 or more, with a peak T m \, varying from 
0.6 to ~ 40 K (see Fig. [TJ. For all sources, we detect wings or 
multiple velocity components. Unlike CH3OH, the S0 2 lines are 
detected in 7 sources only and have a Gaussian shape. The H' s O 
line is seen toward 3 sources, W43-MM1, DR21(OH), and IRAS 
18089-1732; the line profiles exhibit a Gaussian shape, except 
for DR21(OH) where the observed signal is more complex (see 
Fig. [3] and following section). The HDO line emission at 1 mm 
and 3 mm is detected toward 4 sources: W43-MM1, DR21(OH), 
IRAS 18089-1732, andNGC 7538S. The HDO lines are always 



1 IRAM is an international institute for research in millimetre astron- 
omy, co-founded by the Centre National de la Recherche Scientifique 
(France), the Max Planck Gesellschaft (Germany) and the Instituto 
Geografico Nacional (Spain) 
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Table 1. Source sample. 
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# 


Source 




5 




D 


L 


M 




CH-iOH/H^O 


L 12/im 


Ref. 






(J2000) 


(J2000) 


(km-s- 1 ) 


(kpc) 


(xl0 4 L o ) 


(M Q ) 


(pc) 


masers 


(Jy) 




1 


IRAS 05358+3543 


05"39 m 10.4 s 


+35°45'19" 


-17.6 


1.8 


0.7 


400 


0.10 


yes/yes 


6.29 


1,2 


2 


IRAS 18089-1732 


18 h ll m 51.5 s 


-17°31'29" 


+33.8 


3.6 


3.2 


1000 


0.14 


yes/yes 


27.27 


1.3 


3 


IRAS 18151-1208 MM1 


18 h 17 m 58.0 s 


-12°07'27" 


+33.4 


3.0 


1.4 


330 


0.13 


yes/no 


67.89 


4 


4 


IRAS 18151-1208 MM2 


18 h 17 m 50.4 s 


-12°07'55" 


+29.7 


3.0 


0.3 


230 


0.11 


no/yes 


0.65 


4 


5 


IRAS 18264-1152 


18 h 29 m 14.3 s 


-11°50'23" 


+43.6 


3.5 


1.4 


1200 


0.13 


yes/yes 


12.25 


1,2,5 


6 


W43-MM1 


18 h 47 m 47.0 s 


-01°54'28" 


+98.8 


5.5 


2.3 


2000 


0.12 


yes/yes 


11.51 


2.6 


7 


DR21(OH) 


20 h 39 m 00.8 s 


+42°22'48" 


-4.5 


1.7 


0.2 


450 


0.13 




3.10 


7 


8 


DR21(OH)-S 


20 h 39 m 01.5 s 


+42° 22' 04" 


-3.1 


1.7 


0.4 


200 


0.13 




0.88 


7 


9 


NGC 7538S 


23 h 13 m 44.5 s 


+61° 26' 50" 


-57.0 


2.8 


1.3 


400 


0.14 


yes/yes 


4.61 


8,9 


10 


IRAS 23385+6053 


23 h 40 m 53.3 s 


+61°10'19" 


-49.0 


4.9 


1.6 


370 


0.05 


.../yes 


41.96 


1,6,10 



Refere nces. (DlBeuther et al.l d2002<d): (2)lHerpin et alj j200gh: (3)lLeurini et alj 1 2007bl); (4) Marseille et all d2008l): (5)ISzvmczak et all 
d2000h: lEdris et al.l (120071): (6) iMolinari et all dl996h: (7) iMotte et all d2007h : (8) Isandell et alj d2003h : (9) iBarvainis & Deguchil dl989l) : 
ICernicharo et alj 1 1990); Haschi ck et alj d 19891) : (10) Molin ari et alj dl998l) . 



Notes. Columns 2-10 present coordinates, velocities in the local standard of rest, heliocentric distances, luminosities, mass estimates and 
source sizes at millimetre wavelength of the sources, extracted from references given in the last column. 



Table 2. Molecular lines observed. 



Species 


Transition 


Frequency 


f 

'-up 


HPBW 




Receiver 


6v a 


r sys 






(GHz) 


(K) 


(") 






(m-s- 1 ) 


(K) 


HDO 


1 1.0 - * lo.i 


80.5783 


47 


31 


0.79 


A100 


74 


140-190 




3 1.2 — > 2 2 + 


225.8967 


168 


11 


0.54 


B230 


53 


750-1600 


H' s O 


3 1,3 — > 22.0 


203.4075 


204 


12 


0.57 


A230 


59 


600-1300 


SO, 


12o,12 — * 11 1,11 


203.3916 


70 


12 


0.57 


A230 


59 


600-1300 


CH 3 OH 


5-i,5 - > 4o,4E 


84.5212 


40 


29 


0.78 


B100 


71 


140-190 



Notes. Columns 2-9 indicate energy level transitions, line emission rest frequencies, half power beam width (HPBW), main beam efficiency 
rj m b, receiver name, velocity resolution 6v, and system temperature r sys . 
(fl) Values for the VESPA backend. 



either detected or both non-detected (see Fig. [2]). We do not de- 
tect special features in the line profiles, except for DR21(OH) 
whose line profile is most closely fitted by a two-component ve- 
locity model. Measured parameters of single peaked lines are 
listed in Table[3] Methanol and other special features (such as the 
two velocity components in DR21(OH)) are treated in Sect. 14.21 



The 1.3 mm spectrum of W43-MM1 shows detections of the 
expected transitions mixed up with additional lines that we have 
identified (see Fig. |4j». First, emission line from H^O is blended 
with an emission line of the CH3OCH3 molecule at 203.408 GHz 
(see Table |4}. We also detect a mixing between the SO2 tran- 
sition and emission at the frequency of 203.39 2 GHz (maybe 
C3H7CN, as discovered bv lBelloche et al .1120091) . This emission 
seems to be real, as supported by its velocity width that is sim- 
ilar to other lines (see Table |4|. In the same spectrum, three 
other emission lines from CH3OCH3 are clearly detected. All 
the lines have a Gaussian shape. The 3 mm spectrum of W43- 
MM1 shows an unexpected feature near the CH3OH line, com- 
ing from the NH2CHO molecule (see Table [4]). One also notes 
in IRAS 18089-1732 and DR21(OH) that the same group of 
CH3OCH3 lines seen in the W43-MM1 spectrum are detected. 
One of them is blended with H^O. 



Table 4. Characteristics of the unexpected emission lines de- 
tected in W43-MM1. 



Species 


Frequency 






Av b 


J Tdv 




(GHz) 


(K) 


(mK) 


(km-s -1 ) 


(K-km-s- 1 ) 


NH,CHO 


84.542 


7 


190 


6.51(9) 


1.35(2) 


CH3OCH3 


203.383 


9 


430 


5.3(5) 


2.5(3) 


CH3OCH3 


203.406 


113 


430 


3.8(7) 


1.8(2) 


CH3OCH3 


203.418 


9 


370 


5.3(4) 


2.2(3) 


CH3CHO" 


211.936 


444 


260 


6(1) 


1.6(3) 


CH3CHO 


211.957 


216 


210 


6(2) 


1.3(4) 



Notes. Numbers in brackets indicate the error bar associated to the last 
decimal written. 

<<!) Uncertain assignment due to the high level of the upp er energy. It 
could be C3H7CN, as discovered bv lBelloche et al.ll2009l or a SO2 ex- 
tension of the main emission at An = -6.2 km-s~' induced by outflows. 
The line emission velocity width is measured at the FHWM. 



4.2. Line profiles 

For all sources, except W43-MM1, the methanol emission line 
profiles exhibit wings (Fig. 1). Some of them are seen in both 
the blue and red-shifted parts (IRAS 05358+3543, IRAS 18151- 
1208 1, IRAS 18151-1208 2, NGC 7538S), and others only on 
one side (blue-shifted for IRAS 23385+6053, red-shifted for 
IRAS 18264-1 152 and DR21(OH)S). 
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60 100 120 Velocity (km/s) 

Velocity (km/s) 




Fig. 2. Observations of the HDO (1-1) (top) and HDO (3-2) (bottom) emission lines from W43-MM1 (left) and NGC 7538S (right). 
For a clearer view, spectra have been smoothed by reducing the velocity resolution by a factor 2 (see Table[3]for the initial value). 

Table 3. Characteristics of observed lines. 



Source 


Species 


Transition 


"lsr 

(km-s^ 1 ) 


7mb° 

(mK) 


Av b 
(km-s- 1 ) 


J Tdv 
(K-km-s- 1 ) 


cr rmK 

(mK) 


IRAS 05358+3543 


HDO 


J = 1 1 ,o - lo,i 










20 




HDO 


J = 3, i - 2, i 










40 




h; s o 


/=3i,3-2 M 










30 




so 2 


J = 12o,i 2 - 1 1+n 


-15.9(4) 


90 


6.6(5) 


0.61(4) 


30 


IRAS 18089-1732 


HDO 


J = 1 1 ,o - If),] 


32.1(1) 


130 


4.5(3) 


0.62(3) 


10 




HDO 


J = h.i - 2 2 ,i 


32.7(1) 


420 


6.3(3) 


2.8(1) 


50 




Hi s O 


/=3i,3-2 2 , 


32.7(2) 


180 


5.7(2) 


1.09(7) 


30 




S0 2 


J = 12oj 2 - 1 1 1. ii 


32.9(4) 


430 


9.5(4) 


4.4(1) 


30 


IRAS 18151-1208-MM1 


HDO 


/= 1 1 .o - lo,i 










20 




HDO 


I = 3 2 1 - 2 2 , 










50 




Hi s O 


J = 3 U -2 M 










40 




so 2 


J = 12oj 2 - 1 1 1. ii 


33.3(3) 


SO 


4.2(8) 


0.36(5) 


40 


IRAS 18151-1208-MM2 


HDO 


J = 1 i,o - lo,i 










20 




HDO 


J = h,\ -2 2 ,i 










60 




H^O 


/ = 3i| 3 -2 2 ] 










50 




so 2 


J = 12o,12 - 1 1 1. 11 










50 


IRAS 18264-1152 


HDO 


J = 1 1 .o - lo,i 










60 




HDO 


J = h,i - 2 2 ,i 










330 




Hi s O 


/ = 3i| 3 -2 2 ] 










290 




so 2 


^ = 12o,i 2 - lli.n 










290 


W43-MM1 


HDO 


J = 1 i.o - lo,i 


101.3(1) 


107 


5.7(2) 


0.65(2) 


20 




HDO 


J = 3 2 ,i - 2 2 ,i 


100.4(1) 


410 


7.5(2) 


3.24(9) 


60 




Hi 8 


■/ = 3 u -2 2 ,o 


100.6(5) 


220 


3.8(7) 


1.1(3) 


80 




so 2 


^ = 12o,i 2 - 1 li.ii 


100.7(3) 


230 


5.2(7) 


1.29(6) 


80 


DR21(OH)-S 


HDO 


J = 1 i.o - lo.i 










20 




HDO 


= 32,1 - 2 2 ,i 










30 




Hi 8 


^ = 3,j-2 2 . 










40 




so 2 


J = 12o,12 - 1 1 1.11 


-3.5(1) 


240 


3.5(2) 


0.87(3) 


10 


NGC 7538S 


HDO 


J = 1 1 .o — lo.i 


-56.7(5) 


20 


6(1) 


0.11(2) 


20 




HDO 


J = 3 2 ,i - 2 2 ,i 


-57.9(8) 


20 


10(2) 


0.57(9) 


50 




Hi 8 


^ = 3,j-2 2 . 










50 




so 2 


J = 12o,i 2 - 1 li.ii 


-55.7(1) 


580 


6.1(1) 


3.71(5) 


50 


IRAS 23385+6053 


HDO 


J = 1 i.o - lo,i 










10 




HDO 


■/ = 32.1 - 22.1 










60 




h; s o 


/= 3 u -22,o 










30 




S0 2 


J = 12o.l2 - Hi. 11 


-50.3(6) 


40 


8(2) 


0.35(5) 


10 



Notes. Lines are fitted with a single Gaussian profile. Numbers in brackets indicates the error bar associated with the last decimal written. 
DR21(OH) is not reported here, having its own dedicated table. 

(a) Conversion factor is S /T mb = 4.95 Jy/K for IRAM 30m telescope. (6) The line emission velocity width is measured at the full width half 
maximum (FWHM). 
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EH 10 
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i i i i | i i i i 


I18151-MM2 J 


V - 


DR21-(0H)^^^^ 


— 1, 




^ \ 


DR21-(0H)j^y 


\ 


118264 ^v^A^^ 


\ 






118089^^ 


V 


123385^^ 


V 


118151-MMl^^^ 










x 44.3 K 



x 5.21 K 



x 4.42 K 



x 4.51 K 



x 1.70 K 



x 0.57 K 



10 



v-v lsr [km/s] 



10 



Fig. 1. Line profiles of the CH3OH emission observed in our 
sample. Profiles have been offset by V\s t (see Table [T]) and nor- 
malized to unity with a factor indicated on the right part of the 
panel. 



In addition to their wings, the methanol lines of- 
ten show multiple velocity components. This is seen in 
IRAS 05358+3543, IRAS 18089-1732, IRAS 18151-1208- 
MM2, W43-MM1, and DR21(OH). The number of these com- 
ponents varies between 3 and 4 and their strength from ~ 1 to 
40 K. 

In DR21(OH), the 1.3 mm HDO transition exhibits a well 
defined two-component velocity pattern. A double Gaussian fit 
reproduces the observed line with the parameters reported in 
Table [5] The 3 mm HDO spectrum does not show the two ve- 
locity peaks seen at 1.3 mm. A single velocity Gaussian profile 
provides a good fit but only with a rather large (> 10 km-s -1 ) line 
width. We find that a more convincing approach is to analyse this 
line with a two-line Gaussian model using fixed velocities and 
widths derived from the 1.3 mm line, obtaining a good result. 
The fit parameters are reported in Table[5] In the H' 8 spectrum, 
dimethyl ether and SO2 lines are clearly detected. A close exam- 
ination of all dimethyl ether lines reveals that not all lines are 
well fitted with a single velocity component model because lines 
are too wide compared to 1.3 mm HDO components. We choose 
to fit those lines with a two-component model derived from the 
peak velocities observed in the methanol spectrum. We can then 
remove from the spectrum the dimethyl ether contribution. The 
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DR21(OH) with their best double-Gaussian fit. Double compo- 
nent fit of H^O must be considered as tentative to be consistent 
with HDO line profiles. Profiles are smoothed by a factor of 5 
x 0.53 K for a better view. 
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Fig. 4. Spectrum of Hj 8 and SO2 emission lines from W43- 
MM1, showing the serendipitous detection of CH3OCH3, 
C3H7CN and NH2CHO lines. Spectra are smoothed by reduc- 
ing the velocity resolution by a factor 8 (see Tab.[3]for the initial 
value). 



resulting spectrum shows a signal well above the noise in the 
range expected for the H^O transition. The higher velocity part 
is around the 2.9 km-s" 1 velocity observed in the 1.3 mm HDO 
spectrum and is well above the noise. It is also well separated 
from the dimethyl ether line (no blending). On the other hand, 
the lower velocity part, around the -3.2 km- s _1 component of the 
HDO line is partly blended by dimethyl ether and the signal-to- 
noise ratio is lower. Fit results are given in Table [5] 

To estimate the Hl s O emission line strength, and be consis- 
tent with the observed HDO line profiles, the CH3OCH3 lines 
were first subtracted from the spectrum using a model with 
two velocity components. The CH3OCH3 and water emissions 
are expected to originate in the same region because these two 
species are chemically linked. We then tentatively applied the 
velocity component values to fit the H^O emission. The result- 
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Table 5. Components in the line profile of DR21(OH). 



Species 


Transition 


Component 


V 




An" 








Ckm-S- 1 ) 


(mK) 


(km-s^ 1 ) 


HDO 


J = 1 1,0 - lo.i 


1 


-3.2(4) 


120 


8(1) 




J = 1 i.o - lo.i 


2 


2.9(1) 


140 


4.2(3) 


HDO 


■/ = 3 2 ,1 -22.1 


1 


-3.2(1) 


25 


8(1) 




/ = 32,1 -22,! 


2 


2.9(4) 


22 


4.2(3) 




J = 3 1,3 - 22.0 


1 


-2.4(2) 


40 


1.7(6) 




J = 3i.3 - 2 2 .o 


2 


3.5(1) 


60 


8(1) 


S0 2 


J = 12o,i2 - Hi. 11 


1 


-3.3(1) 


640 


5.6(1) 




/=12o,i 2 -Hi,ii 


2 


-0.7(1) 


720 


8.0(1) 



Notes. Table presents HDO line components at 3 mm (bottom), 1 .3 mm 
(top), H' 8 0, and S0 2 in DR21(OH). Numbers in brackets indicates the 
error bar associated with the last decimal written. 
(< " The line emission velocity width is measured at the FWHM. Values 
given for the H^O blue-shifted component are extracted for the best fit 
Gaussian function, which remains quite uncertain. 



ing line emission parameters are reported in Table [5] showing 
a blue-shifted component that remains doubtful, even if it can- 
not be completely rejected (see Fig. [3}. The SO2 line is detected 
but a single velocity component fit does not provide a satisfy- 
ing profile, where a two-velocity fit constrained by the velocity 
difference (taken from the methanol velocities) gives a slightly 
better result (see Table 0. The remaining red wing on the SO2 
line is mainly related to the dimethyl ether line. The remaining 
blue signal on the foot of SO2 cannot be identified, but is weak 
(70 mK). 

5. Methanol maser emission 

The CH3OH (7=5_i 5-4o,4E) transition is often detected as a so- 
called class I maser. Unlike class II masers, which originate 
in the immediate protostellar environment (<10 3 AU), class I 
masers arise at some distance (>10 4 AU) from the protostar. 
Here, shocks between jets and ambient gas cause local den- 
sity and temper ature enhancement s where population inversions 
may occur (e.g. lMinier et al.l20 05). As a consequence, the maser 
emission lines are very sensitive to the local environment in 
which they appear and typically observed to have sharply peaked 
profile that can be distinguished from thermal, turbulent, or out- 
flow emissions. 

In our observations, the line profiles show that the maser and 
thermal emissions are mixed in different proportions. As we aim 
to trace shocks, we need to extract maser emission to ensure a re- 
liably determination of any relation between the supposed evolu- 
tionary stages of our sources and the strengths of the shocks that 
may occur insid e. An identical analysis i s performed with pro- 
files obtained by Ivan der Tak et al. I (l2006h to extend our study. 

5. 1 . Extraction method 

To disentangle the thermal and maser contributions to the 
CH3OH line profi les, we use the extraction method of 
ICaswell et al.l(l2000l) . The strong turbulence within these sources 
is also relevant a role too, leading to a larger width of the 
Gaussian line profile. This is why we enclose its effect under 
the term "thermal" used in this paper. We fit the MDC's en- 
velope contribution in each met hanol profile by includin g the 
sources previously observed by Ivan der Tak et al ] (I2006I) . The 
thermal emission is then fitted by a Gaussian component centred 
on the source velocity, an assumption motivated by all observa- 
tions of class II or class I methanol masers (e.g. ICaswell et alJ 



1 19951 IVal'tts et alJl2000l iBIaszkiewicz & Kusll2004 exhibiting 
maser line profiles that are sharply peaked and differ from a 
Gaussian shape. If the line becomes saturated, the maser profile 
is of course n o longer narrow and the wings grow exponentially 
dElitzurll 19921) but this will never produce a "thermally-looking" 
profile. The possibility of blends of emission originating in mul- 
tiple maser spots within the telescope beam is also real, and our 
assumption cannot be definitely verified until future interferom- 
eter observations are made. 

Once the maser emission has been extracted, we measure its 
velocity-integrated area A maser . We also compute the area A me rm 
of the thermal emission and derive the ratio X m = A maser /A me rm 
of these two quantities, to determine the sources in which the 
maser emission dominates. 

5.2. Results 

The results of the CH3OH class I maser emission extraction are 
given in Table [6] includ ing results for the profiles observed by 
Ivan der Tak et all (120061 ). 

The maser emission is dominant (X m > 10) in two 
sources, IRAS 18151-1208-MM2 and DR21(OH). The very 
high emission intensity in these objects (T pesi k = 44.3 and 
5.21 K, respectively), and the structure of their profiles (see 
Fig. [D are consistent with this finding. A group of four 
sources (IRAS 05358+3543, IRAS 18089-1732, NGC 7538S, 
and IRAS 23385+6053) exhibit distinctive maser emission, 
(X m ~l-2). A majority of objects exhibit weak maser emission 
(X m ~ 0.1 -1.0): IRAS 18151-1208-MM1JRAS 18264-1152, 
W43-MM1, DR21(OH)-S, W33A, AFGL 2591, S140 IRS1, 
NGC 7538 IRS1, and NGC 7538 IRS9. In the three remaining 
objects, W3 IRS5, AFGL 2136, and AFGL 490, the maser emis- 
sion seems to have disappeared and our measurements must be 
treated as upper limits (see Tabled- 

6. Molecular abundances 

6.1. Modelling method 

To derive molecular abundances from o ur observations, we use a 
global modellin g process developed by Ivan der Tak et al. I (fl999l) 
and improved bv lMarseille et al I (l2008[). We first model the SED 
with the MC3D program (IWolf et al.lll999h to derive the mass 
and the temperature distributions inside the MDCs. The source 
model, i.e., its total luminosity, size, and density distribution 
is constrained by observa tions (see Tab l es Q] and [7]), and uses 
dust properties derived by iDraine &Led dl984l) . We then trans- 
fer th e density and temperature distrib ution to the RATRAN 
code dHogerheijde & van der Tata l2000h . This non-LTE radia- 
tive transfer code models the molecular line emissions observed, 
and thus we obtain the molecular abundances and the turbulence 
level, which are the two free parameters of this method. 

The density distribution is characterized by a power law of 
the form 



n(r) = no — 
^0 



(1) 



where no is the density at the reference radius ro (here 100 AU) 
obtained by a fit to the optically thin emission of dust at mil- 
limetre wavelengths. The parameter p is derived from contin- 
uum maps of MDCs or is set to be 1 .5 accor ding to a quasi- 
static infall theory in the inner part of the object (IShu et all 1987b 
iBeuther et al.ll2002cl ; see Table |7). The density and tempera- 
ture distributions are transferred to RATRAN when modelling 
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Table 6. Results of the CH3OH class I maser emission extraction. 









Thermal 






Maser 






4+ 

# 


Source 




Ay 


A 

Ah 


-/ ■ 


Vm 


A 


A m 






(R> 


(Km-s ) 


(K'Km-s 


) 


(Km-s ) 


(K-Km-s ) 




1 


IRAS 05358+3543 


0.48 


5.0 


2.55 


1.7/1.2 


-16.5/-15.2 


3.94 


1.54 


2 


IRAS 18089-1732 


0.59 


3.6 


2.26 


1.5/0.9 


31.6/32.6 


3.20 


1.42 


3 


IRAS 18151-1208-MM1 


0.44 


2.2 


1.03 


0.3/0.2/0.1 


32.4/33.9/31.0 


0.72 


0.70 


4 


IRAS 18151-1208-MM2 


0.70 


5.0 


3.72 


43.8/37.4/3.8 


29.9/30.5/31.6 


639.5 


172 


5 


IRAS 18264-1152 


1.20 


3.8 


4.85 


1.0/0.4 


43.8/41.3 


2.68 


0.55 


6 


W43-MM1 


10.9 


4.2 


53.9 


1.5/1.2 


102.0/96.8 


9.53 


0.18 


7 


DR21(OH) 


1.94 


2.2 


4.84 


5.1/3.1/1.5/0.4 


0.9/-1.2/-3.1/-6.5 


537.35 


64 


8 


DR21(OH)-S 


3.50 


2.2 


8.19 


2.4/1.8/0.2 


-4.3/-5.2/-0.8 


6.78 


0.83 


9 


NGC 7538S 


1.81 


3.1 


5.94 


2.8/2.2 


-55.1/-53.3 


13.2 


2.22 


10 


IRAS 23385+6053 


0.25 


2.6 


0.79 


0.4/0.3 


-50.3/-51.8 


1.29 


1.86 


a 


W3 IRS 5 


0.07 


2.6 


0.18 






<5.4xlO~ J 


<0.03 


b 


AFGL 490 


0.14 


2.4 


0.36 


a 




<3.6xl0 -2 


<0.10 


c 


W33A 


1.51 


3.5 


5.59 


0.4/0.2 


35.2/39.5 


2.74 


0.49 


d 


AFGL 2136 


0.17 


2.9 


0.52 


a 




<6xl0 -4 


<0.0012 


c 


AFGL 2591 


0.39 


2.9 


1.19 


0.2/0.1 


-5.9/-8.6 


0.46 


0.38 


f 


S 140 IRS 1 


0.43 


2.1 


0.95 


0.2/0.1 


-5.7/-8.2 


0.55 


0.57 


g 


NGC 75 3 8 IRS 1 


0.68 


2.9 


1.85 


0.3 


-56.2 


0.51 


0.27 


h 


NGC 7538 IRS9 


0.64 


2.0 


1.35 


0.4/0.2 


-56.3/-55.5 


1.00 


0.74 



Notes. The table presents parameters of the main thermal emission fitted by a Gaussian (temperatures 7^,, velocity widths at FHWM Ad and area 
Aherm), the resulting maser emission peaks characteristics (temperatures T m and velocities u m ), and the area ratios (X m ). 
{a) No methanol maser features detected over the 3<x rms noise threshold. 



the emission lines. The best fit is obtained for a given gas tur- 
bulence velocity uj and a molecular abundance relative to H2 
(Xnoi = «moi/"H,)- When an emission line is not detected, we 
derive an upper limit assuming a line velocity width similar to 
the SO2 one (if available, otherwise we select the thermal com- 
ponent of the methanol line) and refer to the abundance required 
for a 2<x rms signal. 

6.2. Results 

The HDO abundances obtained are between 10 and 10~ 9 
when the line emission is detected (IRAS 18089-1732, 
W43-MM1, DR21(OH), and NGC 7538S; see Table HQ. For 
other sources, the upper limits are between 10~ 13 and 10~ n . 
IRAS 23385+6053 is an exception with the high upper limit of 
1.5 x 10 9 for the HDO (3i, 2 -2 2 ,i) transition. This high value 
(see Table [8} is mainly due to the compact shape of the source 
(0.05 pc compared with the size of the other sources, i.e. ~ 
0.13 pc) that increases the beam dilution for high-excitation 
lines. Furthermore, IRAS 23385+6053 is a very dense and hot 
source (see Table [7) where a high abundance of water species is 
needed to reach the level of the critical density, permitting us to 
avoid strong self-absorption of the line emission (particularly for 
high-energy transitions), and explaining our high upper limit. 

When the emission line of H^O is detected (i.e. in 
IRAS 1 8089- 1732, W43-MM 1 , and DR2 1 (OH)), its abundance 
is inferred to be between 10~ n and 10~ 9 , giving a H^O abun- 
dance estimation between 5xl0~ 8 and 5xl0~ 7 , assuming a so- 
lar isotopic ratio 16 0/ 18 = 500. Other sources give an upper 
limit between 10~ 13 and 10~ 12 , hence a main water isotope abun- 
dance between 5xl0~ n and 5xl0~ 10 . This result seems to indi- 
cate a low water abund ance in the MDC concern ed, relative to 
the values measured by Ivan der Tak et al ] (120061) (see Tables [8] 
and [9}, except for W43-MM1. The comparison of these abun- 
dances with those obtained with the high-energy transition of 
HDO (£u P = 168 K, hence close to the H ls O transition con- 



sidered) show that they are consistent and higher than the low- 
energy transition of HDO (£ up = 47 K), except W43-MM1 (see 
Sect. 8 for a discussion of this result). 

The HDO and H} s O abundances that we derive permit 
us to estimate the HDO/H2O ratio in MDCs. Assuming that 
16 0/ 18 = 500, the ratios are 33xl0" 4 , 7xl0~ 4 , and 19xl0~ 4 for 
IRAS 18089-1732, W43-MM1, and DR21(OH), respectively. 
These values are obtained from high-energy transition results 
(see Table |9). The D/H ra tio agrees with the values obtained 
by Ivan der Tak et ail (120061) . giving values between 3x10 4 and 
38xl0~ 4 . Our results and previous ones are reported in Table[9] 

The CH3OH abundances are derived from the thermal emis- 
sion components and are spread between 10~ 10 and 10 -8 , 
i.e. of the same order of magnitude as those derived by 
Ivan der Tak et al.1 (120061) for mIRb MDCs. We note that W43- 
MM1 has the highest abundance in the sample, with a value 
reaching 5.4xl0~ 7 . 

The modelling of the SO2 emission line implies an 
abundance of between 10 and 10~ 10 (see Table HQ, which 
confir m s previous est i mation s in MDCs (Ivan der Tak et al.l 
[20031: IWakelam et all 120041) . In particular, W43-MM1, 
IRAS 05 358+3543, and IR AS 18264-1152 were already 
studied in iHerpin et alJ d2009l) . where a multi-transition evalua- 
tion of abundances is performed. For these sources, our results 
agree with their estimations. Even in IRAS 18264-1 152, where 
an upper limit has been derived, our result is in accordance with 
the abundance estimated in their study. 

We derive turbulent velocities higher than the speed of sound 
(uj = 0.85 - 2.9 km-s -1 compared to a s = 0.3 - 0.5 km-s -1 ), 
confirming this MDC characteristic. 

A large number of sources in our sample are found to have a 
low upper limit to their Hi s O abundances (see Table[8]). We sus- 
pect that most of the water is frozen in a solid state on the surface 
on dust grains in these cases. As for HDO, IRAS 23385+6053 
is an exception but this result can be easily explained (see 
second paragraph). At the same time, some sources have a 
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Table 7. Modelling parameters. 



# 




n 


P 


vv 




T. 
*0 


(T\ 
K 1 / 






( Qrains m ^ 

\ iLL (\LllJ . Ill 1 




v> m ) 








1 


IRAS 05358+3543 


1.5xl0 j 


1.4 


8.4X10 5 


-0.62 


562 


31.1 


2 


IRAS 18089-1732 


7.4xl0 4 


1.5 


6.1xl0 7 


-0.46 


407 


36.1 


a 
i 


TD AO 10K1 n/*Q 1 /T1V T 1 

IKAo 181 J 1 — lzUo-JVUVLl 


o. /X 1U 


1 .1 


J . J X 1 u 


-U.OU 


oil 


32.J 


4 


IRAS 18151-1208-MM2 


7.5xl0 2 


1.3 


6.5xl0 5 


-0.61 


427 


24.2 


5 


IRAS 18264-1152 


4.0xl0 3 


1.5 


9.8X10 5 


-0.61 


676 


32.6 


6 


W43-MM1 


2.0x10 s 


2.0 


5.5x10" 


-0.54 


302 


20.0 


7 


DR21(OH) 


4.1xl0 4 


1.5 


3.5xl0 7 


-0.44 


191 


20.7 


8 


DR21(OH)-S 


1.8xl0 4 


1.5 


1.5xl0 7 


-0.43 


224 


25.3 


9 


NGC 7538S 


2.9xl0 4 


1.5 


2.4xl0 7 


-0.45 


316 


30.4 


10 


IRAS 23385+6053 


5.4xl0 5 


1.5 


5.0xl0 8 


-0.42 


347 


38.7 



Notes. Table presents the source name, the number of dust grains « per meter cube at r = 100 AU (following a standard MRN distribution in 
size and types), the power-law index p, the mean H2 density in the source, the best-fit temperature distribution coefficients (T = 7o(r/ro) ff ) and the 
mean temperature in the source. 

Table 8. Modelling results. 

X mol 2v T (km-s- 1 ) 



Source 


HDO 


H»0 


S0 2 


CH 3 OH 


HDO 




S0 2 


CH 3 OH 




3 mm 


1 mm 








3 mm 


1 mm 








IRAS 05358+3543 


< 3.8(-12) 


< 7.6C-13) 


<4.8(-13) 


5.K-11) 


1.3(-9) 








2.4 


2.9 


IRAS 18089-1732 


2.0(-ll) 


1.3(-10) 


7.9(-ll) 


4.2(-ll) 


1.3(-10) 


2.4 


4.2 


5.2 


5.8 


2.1 


IRAS 18151-1208-MM1 


<5.8(-13) 


< 2.2(-13) 


< 5.2(-13) 


1.2(-10) 


1.6(-9) 








2.5 


1.4 


IRAS 18151-1208-MM2 


<2.0(-12) 


< 1.1(-12) 


< 1.8(-12) 


< 1.2(-10) 


1.0(-8) 










3.2 


IRAS 18264-1152 


<2.3(-12) 


< 1.6(-12) 


< 1.1(-12) 


< 1.2(-10) 


3.2(-9) 










2.4 


W43-MM1 


1.5(-9) 


1.7(-9) 


4.6(-9) 


2.4(-10) 


4.8(-7) 


2.9 


2.9 


2.7 


3.3 


4.2 


DR21(OH) n 


6.1(-13) 


8.0(-ll) 


8.5(-ll) 


8.4(-ll) 


1.0(-10) 


2.3 


3.2 


1.7 


3.7 


1.1 


DR21(OH)-S 


< 8.0(-14) 


< 2.0(-ll) 


< 1.8(-13) 


7.8(-12) 


4.0(-10) 








2.0 


1.2 


NGC 7538S 


8.9(-14) 


1.3(-13) 


< 2.8(-13) 


3.4(-ll) 


4.5(-10) 


2.8 


2.6 




3.5 


1.9 


IRAS 23385+6053 


< 1.5(-13) 


< 1.5(-9) 


< 1.5(-10) 


3.4(-ll) 


9.4(-10) 








4.8 


1.0 



Notes. Table presents molecular abundances relative to H 2 and turbulent velocities derived from the global modelling method. Values are indicated 
with the form x{-y) = x X 10~ y . When a line emission is not detected, the upper limit assume a line velocity width similar to the SO2 one and 
refers to the abundance required for a 2cr rms signal. 
Two velocity components with different line widths. 



7. Correlations 

To investigate how the molecular line emission is related to the 
physical properties of the MDCs, we compare the line fluxes and 
abundances to the mid-IR (12 (xm) emission seen by MSX and 
IRAS. Previous studies identified a link bet ween mid-infrared 
emission and the physical e volution of MDCs (Ivan der Tak et al.l 
200fi lMarseiH e et al. 20081). To increase the sample s ize, we in- 
clude the data collected by Ivan der Tak et alj d2006l) for eight 
mIRb-MDCs. 

7.1. Method 

We compute the correlation factors p xy (see Appendix lAl be- 
tween the following parameters: mass M, luminosity L, thermal 
line width 5u (from the SO2 emission or, if not detected, ther- 
mal component of CH3OH), monochromatic luminosity Z42 at 
12 |xm, the molecular emission-line flux observed in CH3OH 
(masered A m , thermal A t , the ratio X m , and the abundance X mo i) 
and the abundance of H2O and HDO in sources where they are 
detected. Sources with upper limits to the 12/iin luminosity or 
the molecular emission-line flux are not used in the correlation 
analysis. Applying a two-tailed correlation test for N = 18 data 
points (or 7 for water), we reject correlation factors with val- 



Table 9. Abundance ratios. 



# 


Source 


HDO/H 2 


H 2 






xlO" 4 


xitr 8 


2 


IRAS 18089-1732 


33 


4.0 


6 


W43-MM1 


7 


230 


7 


DR21(OH) 


19 


4.3 


a 


W3 IRS5 


13 


30 


c 


W33A 


30 


80 


c 


AFGL2591 


3 


40 


f 


NGC 7538 IRS1 


38 


50 



Notes. Table presents HDO/H2O ratios and water abundance derived 
from H} s O. Values in the bottom part are taken from radiative transfer 
model results and Hi 8 column densities of lvan der Tak et aT] (12006). 



clear water abundance enhancement: results on W4 3-MM1 con- 
firm that it hosts a HM C (already detected by iMotte et all 
12001 iHeroin etaDl2009l) . and others on IRAS 18089-1732 and 
DR21(OH) may indicate the same. 
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DR21-0H 

— observation 

— thermal 

— maser 



Table 10. Correlation matrix for the sample and the CH3OH 
emission analysis. 




-10 



15 



-5 

[km/ s 



W43-MM1 

— observation 

— thermal 

— maser 



10 




100 
[km/ s 

Fig. 5. Extraction of the methanol class I maser emission (blue 
line) from observations (black line) by fitting the thermal emis- 
sion with a Gaussian profile at u^, in two sources: DR21(OH) 
(top) and W43-MM1 (bottom). 



ues smaller than 0.49 (respectively 0.76) in absolute value. We 
also use the method of partial correlations (see Appendix [Ab to 
search for biases that could be introduced by the sample selec- 
tion and extract the most probable real correlations, this allow- 
ing us to correct for false corr elations induced by real ones (see 
Rodgers & Ni cewandedl 19881 for a review of the subject). 



7.2. Search for biases in the sample 

In our systematic search for correlations, we first look for links 
between 'standard variables' (mass, luminosity, and thermal line 
width and luminosities at 12 (tin). Apart from the thermal line 
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widths, correlations are checked in logarithmic scales. To reli- 
ably compare our sources, we need to account for their different 
distances. To do so, we assume that they all are at a reference dis- 
tance of 1.7 kpc by applying a correction factor to the flux den- 
sities, i.e. measuring monochromatic luminosities at 12 \xn\ and 
comparable methanol maser emissions. No correction is made 
for beam dilution, as the infrared and maser emissions are known 
to originate in a region much smaller than the beam size (point- 
like emission). We search for correlations with 'standard vari- 
ables' regardless of whether there is a physical reason to expect 
a correlation: this search acts as a reality check on our method. 

Three relations are identified when applying the correlation 
method. First, a link seems to exist between the mass and the 
luminosity of the sources (pm.l = 0.50, thus 97.5 % of a chance 
of being real statistically). However, a partial correlation anal- 
ysis rejects this trend by giving a value under the confidence 
threshold (pml.Au = 0.36) when keeping Av constant. Second, a 
weak correlation may exist between thermal line velocity widths 
and the luminosity of the sources (pao.l = 0.49) but again par- 
tial correlation factors reduce this value to 0.34, by keeping the 
mass constant. This result has a 16.6 % probability of being ob- 
tained by chance, which is too high to claim that a link exists. A 
third correlation appears between luminosity and mid-IR emis- 
sion, with Pl,l u =0.80. This trend is clearly confirmed by partial 
correlation factors, indicating that we can be confident in this re- 
sult to a level of more than 99.9% (see Fig. [6}. 

7.3. Methanol correlations 

We show in Sect. [5] that two components can be extracted from 
the methanol line emission: a thermal (A t ) and a maser one (A m ). 
As these two components reflect two different characteristics of 
the sources, we try to find correlations between them and the 
intrinsic variables (mass, luminosity, etc.). We also test a possi- 
ble correlation with the ratio X m and the abundance X mo ]. The 
overall correlation matrix, summarizing the results, is given in 
Table[T0] 

The first correlation relates the mid-IR luminosities to the 
maser component {pA m ,L n = — 0.67). The correlation factor is 
negative, indicating an anti-correlation between these two vari- 
ables (see Fig|6]l. This trend is confirmed by the partial correla- 
tion analysis, reducing the correlation factor to -0.65 when the 
thermal component is assumed to be constant, and giving ap- 
proximately a 98.5% of chance of this correlation being real. 
We note that there is no significant correlation of the integrated 
(thermal + maser) CH3OH line flux in our data. A partial corre- 
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lation analysis demonstrates that the weakening of the CH3OH 
maser flux with increasing 12yum luminosity is not caused by 
increasing distance. 

In addition, we find two correlations associated with A t . The 
first one indicates that thermal CH3OH emission increases with 
the mass of the object (pa { ,m = 0.70, see Fig. [6j. The partial 
correlation analysis confirms this trend and, at its most extreme, 
decreases the correlation factor to 0.65 when L is assumed to 
be constant. There is a probability of lower than 0.4% that this 
trend is obtained by chance. The thermal emission is also linked 
to the maser emission, the correlation factor being equal to 0.76 
(see Fig Et- The study of partial correlation factors slightly de- 
creases the initial value to 0.75, giving a high chance (99.9% 
statistically) for this result being real. 

Three other links, all related to the ratio X m (see Table ITDb . 
are found but rejected by the partial correlation analysis, except 
the correlation with the maser emission which increases (up to 
0.96; this result is obvious as X m = A m /A t ). More precisely, par- 
tial correlation factors obtained for other correlations are well 
below the threshold, with \p\ = 0.27 at the maximum, hence 
there is more than a 27% chance of obtaining this result by luck. 
Thus, this ratio is interesting but a bit less relevant than A m . 



7.4. Water correlations 

We now consider the water abundances, includ ing numbers ob- 
tained in other MDCs ( Ivan der Tak et alj|2006h . We also search 
for possible correlations with the deuteration level D/H. Only 
the sources where water line emission is detected are included, 
hence 8 MDCs. As a consequence, the confidence threshold for 
a real correlation is now 0.756, assuming a two-tailed test (or 
two-cr test). 

First of all, the analysis of biases in this restricted sample 
reveals, again, a link between the mid-IR luminosity at 12 (j.m 
L\2 and the total luminosity L (see Table ITTb. Partial correlation 
factors confirm this trend with a final correlation factor of 0.79, 
hence 98% of chance of this relation being real. This is a strong 
indication that these two variables are correlated, even when a 
smaller sample of sources is chosen. 

Another correlation is found, between the mid-IR emission 
L\2 and the HDO abundance (pli 2 ,hdo = 0.80, see Table fTTT i. 
This is not confirmed by the partial correlation analysis, for 
which we measure a factor beneath the threshold when the H^O 
abundance is assumed to be constant at Pl p ,hdo,h 18 o = 

0.58, 

hence a 12.9% chance of having obtained this result with luck, 
statistically. 

The last connection that we find is between the HDO and 
H^O 0°hdoh 1s o - 0.95, see Table [TT1 and Fig.|6]l. This relation 
is strong and the partial correlation method confirms this trend 
by decreasing, at most, the correlation factor to 0.90 when the 
mid-IR flux density Z42 is assumed to be constant. This value 
indicates that our result is real in more than 99.8% of the cases. 
This correlation was already noted by eye (Table|8]l and confirms 
that the D/H ratio is of the same order of magnitude in all studied 
MDCs. 

More globally, our study illustrates that water abundance 
(hence detection) does not depend on a basic characteristic of 
HMPOs (such as their mass or luminosity) but needs further in- 
vestigations in each specific source. This point is discussed in 
Sect. 8.2. 



Table 11. Correlation matrix for the water emission analysis. 
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8. Discussion 

8. 1 . Methanol masers: shocks in embedded objects ? 

Our study reveals a clear link between the mid-IR brightness 
of th e 18 MDCs (including the sample by Ivan der Tak et al.l 
l2006h that we have observed and the methanol maser emis- 
sion at 84.5 GHz (class I maser). Unlike class II masers, 
which originate in static regions near massive protostars, class 
I masers are known to occur in extended zones, at some dis- 
tance from the star formation sites (~10 4 AU), w here molecu- 
lar ou t flows interact with the sur rounding medium dMinier et al.l 
120051) . ICvganowski et al.l (l2009h study 20 massive young stellar 
objects observed with interferometry to confirm this behaviour, 
and claim that methanol class I masers trace the molecular out- 
flows around them. 

Molecular outflows are indicators of an accretion process, 
and are known to decrease in intensity with time as the protostar 
evolves (low-mass case, see iBontemps et aDll996h . Presenting 
an early stage of massive star formation, most of MDCs har- 
bour powerful outflows. However, extraction of molecular out- 
flow strength is biased by the unknown orientation and the pos- 
sible source multiplicity, and no clear drop in strength can be ob- 
served between mlR q- and mIRb-MDCs dBeuther et alj|2002at 
iMarseille et al.l2008h . Furthermore, the decrease in power is sus- 
pected to occur at a later stage, i.e. in MDCs where an Hn region 
has developed. As a consequence, the variation in the methanol 
class I maser emission cannot be linked to a drop in the molec- 
ular outflow strength from mlRq to mIRb sources. Like class II 
masers, class I maser emission needs a very specific environ- 
ment, with a sufficient spatial extend. Moreover, the detection 
of multiple velocity components implies that the class I maser 
sites are localized at the interface between the molecular out- 
flows and the ambient gas. We can assume that mlRq sources 
contain more favourable zones where this kind of maser emis- 
sion is more likely to occur. 

The origin of the varying mid-IR brightnesses in MDCs re- 
mains quite unclear. On the one hand, modelling their spectral 
energy distribution shows that the mid-IR emission is critically 
dependent on the line of sight when molecul ar outflows dig 
a cav i ty from which IR emission can escape JRobitaill e et al.l 
I2006t Ivan der Tak et al.l 120061; IMarseille et all 120081) . Thus the 
variations in mid-IR brightness are orientation-dependent and 
do not reflect any difference in evolutionary stage. On the other 
hand, chemical models and observations of sulphur-bearing 
molecules or cold gas tracers such as N2H + seem to indicate 
that mid-IR brighter so urces are more evolved and warmer, rela- 
tive to more quiet ones (|Reid & Matthewsll2008t IMarseille et all 
l2008HHerpin et al.ll2009h . 
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Fig. 6. Correlation plots between (a) monochromatic luminosities at 12 pin and source luminosity, (b) monochromatic luminosities 
at 12 |j.m and maser component of CH3OH emission line, (c) masses of HMPOs and the thermal component of CH3OH emission 
lines, (d) the thermal and the maser component of CH3OH emission lines, (e) the maser-to-thermal emission ratio and the maser 
emission, (f) the molecular abunda nces of HDO and H^ 8 0. Sources of our sample are labelled with numbers (1 to 10), sources 
studied by Ivan der Tak et all d2006h are labelled with letters (a to h). Solid line : best linear fit obtained with a least squares method. 
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The anti-correlation that we find between the 12 |am lu- 
minosity and the maser emission makes the situation clearer. 
Indeed, the hypothesis that the variations in mid-IR brightness 
are due to orientation differences cannot explain this trend, as 
the maser emission is not affected by the line of sight direc- 
tion. Furthermore, the multiple velocity components would be 
more easily detected if the molecular outflows, i.e. the cavity 
from where the mid-IR is emitted, were to follow the line of 
sight (case of mIRb-MDCs). However, we observe the opposite 
trend. A valid interpretation defines mIRq-HMPOs as embed- 
ded objects, in which the envelope mainly interacts with out- 
flows. Here, the term 'embedded' must be defined to avoid con- 
fusion. Our study has found that the envelope masses of MDCs 
are not linked to their mid-IR brightness, hence are not suffi- 
cient to determine wether they are embedded or not. Taking into 
account the material distribution, the SED modelling of MDCs 
shows that their spherically symmetric distrib ution is respon- 
sible for their mid-IR emi ssion from MDCs dRobitaille et alj 
200$ iMarseille et afll2008h . In this sense, our study shows that 
mlRq-MDCs are embedded sources. Bright mid-IR objects may 
thus have more flattened envelopes than weak mid-IR ones. 
Fro m an observationa l point of view, this idea is supported by 
the iKurtz et al.l (12004 study of the IRAS 23385+6053 source 
where maser emission is detected around the source velocity (at 
-49.7 km-s -1 and -51.7 km-s -1 ). The conclusion of this study is 
that this observation can be explained by accretion shocks near 
the central object. This could explain also what we observe for 
W43-MM1, w here strong clues of infall toward this source have 
been detected dHerpin et al.ll2009l) . 

Nevertheless, an embedded phase does not imply that they 
correspond to an earlier stage of evolution. For example, this 
characteristic can be a direct consequence of the initial con- 
ditions of the MDC formation. However, our results support a 
scenario in which the initial collapsing clump, radially concen- 
trated, is disrupted by the molecular bipolar outflows during the 
accretion phase. Moreover, the mid-IR brightness and evolution- 
ary stage of MDCs are linked. 

8.2. The origin of water emission 

Previous res ults on HDO and H^ 8 Q d etections in mIRb-MDCs 
obtained by Ivan der Tak et al ] (l2006h demonstrated that these 
objects are not systematically strong H2O emitters. Our study 
has a similar detection rate (just below ~50 %) and does not 
show any evidence of an emission enhancement from mlRq to 
mIRb sources. This contradicts the hypothesis that warmer ob- 
jects should be stronger emitters. Indeed, a warmer environment 
should increase the release of water in the gas phase from the 
ice on dust grains. Thus, the origin of the strong line emission 
of water species in particular sources cannot be linked to the 
'quiet' versus 'bright' scheme. Each source must be treated in- 
dividually. 

The IRAS18089-1732 MDC is the most luminous object in 
the mlRq sample (3.2xl0 4 L ), and is also the source where 
the turbulence level is the highest (v? = 1-2 - 2.6 km-s -1 ). 
The discernible methanol class I maser emission in this object 
(X m = 1 .42) is indicative of interactions between the molecular 
outflows and the envelope. We have shown that the abundance of 
H2O and HDO is high enough to be detected (see Table |8j. We 
thus conlude that, in IRAS 18264-1152, the water abundance 
enhancement is caused by the large amount of energy avail- 
able from luminosity, gas turbulence, and molecular outflows 
(shocks). In addition, the detection of unexpected dimethyl-ether 
emission in the Hj 8 spectrum suggests that complex chemical 



components are released in the gas phase, including the water 
ice. We propose that the enhancement originates in a hot molec- 
ular core whose presence may also explain why water is detected 
in this source. 

In W43-MM1, many molecular rotational tra nsitions have 
been detected, covering a wide range of energy dHerpin et al.l 
120091) . The large amount of detected species, even unexpected 
complex molecules, are indicative of an extended hot molecu- 
lar core, and make this source of particular i nterest. Moreover , 
this region is susp ected to be in global infall dMotte et al.ll2.Q03b 
iHerpin et al.l2009h . This very likely explains the very high abun- 
dances (see Table 9) in this source: the ice on dust grains must 
have evaporated, increasing the water abundance to 2xl0~ 6 . 

Water line profiles towards DR21(OH) exhibit a double ve- 
locity component that may be linked to powerful bipolar out- 
flows, the water emission being produced by shocks between 
molecular outflows and the massive envelope. This is confirmed 
by the strong methanol class I maser emission (X m = 1.58), and 
wherein three components are identified, but the velocity compo- 
nents observed in H2O and HDO profiles differ from the velocity 
components of the maser emission (see Table|5]l. In addition, this 
scenario (increase in the amount of water species in the gas phase 
due to shocks) should apply to IRAS 18151-1208-MM2, the 
most powerful methanol maser emitter (X m = 13.9), but no HDO 
and Hi 8 O emission is detected. Hence, we conclude that shocks 
in DR21(OH) are not the origin of the abundance enhancement 
of water. DR21(OH) is one of three sources where unexpected 
detections of dimethyl-ether have been observed, showing that a 
hot molecular core takes place there. This is the best argument 
to explain the water abundance in this source. 

On the other hand, HDO is detected in the source 
NGC 7538S, where there is no indication of a hot core. 
Furthermore, this object is not a strong methanol class I maser 
emitter (X m = 0.21), showing that no strong shocks occur in- 
side it. The origin of the water detection is then more difficult 
to explain. In addition, our modelling of NGC 7538S shows that 
the water abundance is not particularly high compared to upper 
limits derived for other sources (see Table |H), even if this MDC 
has the highest temperature of the sample, well above the others 
((T) = 38.7 K). This high temperature is caused by the small 
size of this source (0.05 pc) combined with its high luminosity 
(1.3xl0 4 L ). Therefore, we suppose that the emission of wa- 
ter species in NGC 7538S, while not as abundant as those in 
hot cores, is stronger due to this high temperature. However, the 
derived abundances are consistent with an abundance jump of 
lxlO 4 in the hottest part of NGC 7538S. Indeed, we obtained 
low but almost equivalent values when considering either low or 
high energy level transitions (~ lxl0~ 13 ). Another possibility 
would be that NGC 7538S harbors a brand new hot core. In that 
case, primal ices forming on dust grains in the atomic phase of 
diffus e clouds are strongly deuterated by the fo rmation mecha- 
nism dLipshtat et al.l 12004 ICazaux etaT1l2008b . This could ex- 
plain why HDO is detected while Hi O is not, these primal ices 
being first desorbed when a hot core starts to develop. 

Our study of a large sample of MDCs permits us to con- 
clude that the water detection in these objects is not linked to 
their mid-IR brightness, i.e. a different evolutionary stage or a 
different distribution of materials. In addition, it shows that wa- 
ter detection is associated with a hot molecular core. This seems 
reasonable as in the gas phase hot cores are known to release 
complex molecules that are detected, while ice on dust grains 
evaporates. In rare cases, such as NGC 7538S, water line emis- 
sion may be detected due to a globally high temperature in the 
source. 
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8.3. mIRq vs mIRb: review of common and different 
properties 

Several clues show that mIRq- and mIRb-MDCs have similar 
properties. First, previous work on large samples of MDCs in- 
dicate that the velocity widths of emission lines are dominated 
by gas turbulence (vj = 0.8 - 2.9 km-s -1 ), whos e correspond- 
ing width is much higher than the thermal width dBeuther et alj 
l2002bHMotte et al.ll2007h . Our observations confirm this result, 
and that this high turbulence may be the consequence of the ini- 
tial cloud collapse, as su spected in large-s cale filaments (e.g. 
DR21 in Cvgnu s X. see iMotte et al.l 120071 or the Ophiuchus 
main cloud, see I Andre et al.l l2007lT ln this context, the turbu- 
lence is supposed to dissipate by means of ambipolar diffusion 
while the MDC evolves, lea ding to a quasi-static collapse (see 
IWard-Thompso n et al. 2007, for a review). Our study shows that 
the turbulence does not vary between mIRq- and mIRb-MDCs, 
indicating that their evolutionary stages are similar. We also note 
that the range of masses in our sample (200-2000 M ) has no 
impact on the mid-IR brightness of the sources. To first order, 
the most massive sources should have a higher opacity at mid- 
IR wavelengths. Our study contradicts this simple view. The 
emission process of mid-IR is complex in MDCs, and must in- 
clude other parameters. Previous work on the SED modelling of 
MDCs sh ow that the density distribution plays a maj or role in 
this issue dRobitaille et al.ll2006tlMarseille et al.ll2008l) . 

Our work has revealed that the water abundance in MDCs 
is independent of their mid-IR brightness. The detection ratio 
in mIRq and mIRb sources is the same (slightly below 50%). 
Water abundances derived are similar, with a wide range of val- 
ues in the external cold parts of the objects (between ~lxl0~ 13 
and ~lxlO" 9 ). From our study, we conclude that most objects 
for which water species are detected contain a more or less ex- 
tended hot molecular core. Finally, we have found that mIRq 
and mIRb sources share the sam e deuteration ratio D/H ~(5- 
40)xl0 4 . As already reported in Ivan der Tak et all (|2006), this 
value is one hundred times higher than the interstellar ratio, and 
confirms that the molecular emission observed is coming from 
evaporated ices. 

These results illustrate the difficulty in differentiating be- 
tween the mIRq and mIRb classes. The origin of this distinc- 
tion is linked to the class and c lass I definitions for low- 
mass protostars dAndre et al.l 1 1993b . where the mid-IR lumi- 
nosity is compared to the overall luminosity of the sources. 
A ratio Li<25/jm/iboi < 0.3% indicates a class object, and 
a class I object in the opposite case. This definition was ex- 
tended to MDCs where a similar behaviour at mid-IR wave- 
lengths was observed, and 'quiet' sources were suspected to 
represent an earlier stage of the massive star formation evo- 
lution. However, it appears more and more clearly that di- 
rectly applying definitions from low-mass protostars to high- 
mass star-forming regions is unwise and may not be adapted 
to correctly transcribe different evolutionary stages. This is 
caused by the multiplicity of MDCs, that are now identi- 
fied as massive proto-clusters, but also to the presence of 
powerful ou tflows , hot molecular c ores and suspected infall 
(IMotte et alj|2003l:ivan der Tak et al.ll2006HLeurini et al.ll2007bt 
Ward-Thompson et al.ll2007t iMarseille et alj|2008t iHerpin et alj 
2009). Furthermore, we show that the observed variation in mid- 



IR brightness seems to be associated with the matter distribution 
around the protocluster, being more or less spherically symmet- 
ric. 

Interpreting mIRq sources as more embedded objects sup- 
ports the idea that they represent an earlier stage of the massive 



star formation. Indeed, with the hypothesis that MDCs start in a 
deeply embedded phase where matter is then flattened by pow- 
erful outflows, as the methanol class I maser trend seems to il- 
lustrate, the evolutionary sequence from mIRq- to mIRb-MDCs 
is still valid. 



9. Conclusions 

We summarize the main conclusions of our study: 

1. Using a correlation method, we have derived a link between 
the methanol class I maser emission and the mid-IR luminos- 
ity of the MDCs that we observed. We propose that this link 
can be explained by the mIRq sources being more deeply 
embedded, i.e. having a more spherically symmetric distri- 
bution of their envelope material, creating a favourable envi- 
ronment for the creation of these masers. 

2. The detection of line emission from HbO and HDO is linked 
to a hot molecular core, independent of the mid-IR bright- 
ness. 

3. The study of the overall sample has demonstrated that MDCs 
have many common properties: their size (0.1 pc), mass 
(~ 200 - 2000 M ), luminosity (~ lxlO 4 L ), turbulent ve- 
locity of the gas (vj = 1-3 km-s -1 ), incidence of hot cores 
(< 50%), and deuteration level (D/H ~ 5 - 40xl0 -4 ). This 
implies the idea that these objects can be treated as a single 
class, and that they constitute a cluster of protostars, some of 
them being massive. 

4. The classification of MDCs is complex due to their mul- 
tiplicity and the multiple physical processes (independent 
of their evolutionary stage) that occur inside, which have a 
strong impact on observations, i.e. hot molecular core pres- 
ence, and powerful outflows, suspected collapse. We finally 
suggest that the mid-IR 'quiet' and 'bright' classification 
needs more investigation but is still relevant when we assume 
that mIRq sources represent an earlier stage of massive star 
formation than mIRb ones. 

In the near future, further investigation of the mIRq ver- 
sus mIRb-MDCs evolutionary stage problem will be possible, 
particularly by means of direct water line detections with the 
Herschel-HIFI instrument and the high-angular resolution of 
present (IRAM, SMA, CARMA) and future (ALMA) interfer- 
ometers. These may provide us with the final arguments permit- 
ting us to find a coherent classification, by age, of the MDCs, 
hence allowing us to describe a complete evolution scenario 
leading to the formation of massive stars. 

Acknowledgements. We thank the editor, Malcolm Walmsley, for many com- 
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Appendix A: Elements of Correlation theory 

For more detail and a global review of co rrelation factor usage, 
we refer to lRodgers & Nicewanderl dl988l) . 



A.1. Correlation factor definition 

Given two variables x and y with N elements x, and y,, the cor- 
relation factor p Ky between these two variables is measured to 
be 



Px,y 



where 



1=1 

N 2 cr,o- v ' 



1 N 

i=l 

1 N 



(A.l) 

(A.2) 
(A.3) 



A correlation factor has a value between -1 and 1, a positive 
value indicating a correlation between the two variables and a 
negative value indicating an anti-correlation. If two variables 
are distributed randomly following a normal law, the density of 
probability P(p) of finding a correlation factor between p and 
p + dp is given by 



Pip) 



1 



(i-p 



2x(y-2)/2 



(A.4) 



where F is the classic gamma function, and v — N - 2 where N 
is the number of elements (see Fig. lA.ll for plots at different N). 
Fhus, the absolute value of the correlation gives, as a function 
of the number of elements N, a percentage of the distribution 
(Xi,yf) obtained by chance. Fhe higher the absolute value of the 
correlation factor is, the lower this percentage. As an example, a 
correlation factor of -0.5 obtained with 20 elements has a good 
chance (97.5 %) of being real. With only 6 elements, the same 
correlation factor has a lesser chance (68.8 %) of being real. 

One can note that the P(p) function peaks more and more 
around p — while N increases. More precisely, the standard 
deviation of the P(p) distribution is given by 



-<P l > - <P> 1 



1 



N- 1 



(A.5) 



It is then common practise to perform a two-tailed (two-sigma) 
test to reject the null-hypothesis, i.e. that the (x,,y,) points does 
not follow a distribution of two random variables. This permits 
us to define a correlation factor threshold p t equal to 2<x to claim 
wether a correlation has a good chance of being real or not 



[pi > Pt = 



(A.6) 



This formula justifies that N > 6, which is a necessary criterion 
to obtain a threshold lower than 1 . In the case of our study, we 
used the two-sigma test to discriminate between true and false 
correlation. 



A.2. Partial correlations 

When studying multiple variables at the same time, it is very of- 
ten useful to have a clear view of the links between the variables, 
and first determine wether they exist or not. Existence of these 
links can be identified by determining the correlation factors of 
pairs of variables (see section above). 
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For a large set of variables {X,}, all the links obtained by 
standard correlations factors px u Xj must be verified with a sys- 
tematic checking of all partial correlations associated with it, i.e. 

Xj,x kt{i ji must be above the threshold of confidence and have 
the same sign as px h x . . 



-1 -0.3 0.5 1 

(a) 




(c) 

Fig. A.l. Plots of the P(p) with different number of elements on 
the sample studied: (a) with N = 10, (b) with N = 25, and (c) 
with N = 100. 



Unfortunately, the use of correlation factors restricts the 
study to pairs of variables, which can lead to the identification 
of false links, i.e. two variables that seems to be correlated when 
a third variable is the true origin of this link. One can take the 
example of a sample of pupils of all ages. A raw study of cor- 
relations between their ages, weights, and scientific levels will 
certainly make establish a false link: between the weight and the 
scientific level. Indeed, we can easily understand that their ages 
are at the origin of their growth, physically and intellectually. For 
a sample in which this logical deductions would be impossible, 
how can we distinguish true from false correlations ? 

The use of partial correlations permits us to avoid this issue. 
Giving a set of three variables x, y, and z, partial correlation p^ A 
derives the correlation factor between two variables (for example 
x and y), assuming that the third one z remains constant during 
the measurements. The partial correlation factor is then given by 

Pxy-PxzPyz 

Pxyx = i ( A -7) 
p - p%)(l - pj z ) 

If the partial correlation factor obtained is below the confidence 
threshold chosen or has a different sign from the standard corre- 
lation factor, then the link between the two variables is false. 
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